A Framework For Supporting Load Shedding in Data Stream Management Systems
نویسندگان
چکیده
The arrival rate of tuples in a data stream can be unpredictable and bursty. Many stream-based applications have Quality of Service (QoS) requirements that need to be satisfied by the underlying stream processing system. In order to avoid violating predefined QoS requirements during temporary overload periods, a load shedding strategy is necessary and critical for a data stream management system. In this paper, we propose a framework and techniques for a general load shedding strategy by dynamically inserting load shedders into query plans or deactivating existing shedders based on the estimation of current system load. These shedders can drop tuples in either a randomized manner or using user-specified application semantics. Specifically, we address the problem of predicting the system load which implicitly determines when load shedding is needed and how much. We also address the problem of determining the optimal placement of a shedder in a query plan, and the problem of how to distribute the total number of tuples to be dropped among all shedders with the goal of minimizing the total relative errors in the final query results due to load shedding. Finally, we have conducted extensive experiments to validate the effectiveness and efficiency of the proposed load shedding techniques.
منابع مشابه
Content-based Load Shedding in Multimedia Data Stream Management System
Overload management has become very important in public safety systems that analyse high performance multimedia data streams, especially in the case of detection of terrorist and criminal dangers. Efficient overload management improves the accuracy of automatic identification of persons suspected of terrorist or criminal activity without requiring interaction with them. We argue that in order t...
متن کاملStochastic Congestion Alleviation with a Trade-off between Demand Response Resources and Load Shedding
Under the smart grid environments, Demand Response Resources (DRRs) as power system resources can effectively participate in and improve performance of electric systems. Congestion management is one of the technical challenges in which DRRs can play a significant role. Previously, congestion management is applied without considering the power system uncertainties. Therefore, a stochastic conges...
متن کاملHow to Screen a Data Stream - Quality-Driven Load Shedding in Sensor Data Streams
As most data stream sources exhibit bursty data rates, data stream management systems must recurrently cope with load spikes that exceed the average workload to a considerable degree. To guarantee low-latency processing results, load has to be shed from the stream, when data rates overstress system resources. There exist numerous load shedding strategies to delete excess data. However, the cons...
متن کاملMining Databases and Data Streams with Query Languages and Rules
Among data-intensive applications that are beyond the reach of traditional Data Base Management Systems (DBMS), data mining stands out because of practical importance and the complexity of the research problems that must be solved before the vision of Inductive DBMS can become a reality. In this paper, we first discuss technical developments that have occurred since the very notion of Inductive...
متن کاملAnatomy of a Data Stream Management System
In this paper, we identify issues and present solutions developed – both theoretical and experimental – during the course of developing a data stream management system (DSMS) for applications in a sensor environment. Specifically, we summarize our solutions for CQ processing, trigger mechanisms, and Quality of Service (QoS) management in a stream data processing system. Specifically, we first p...
متن کامل